A fast procedure for calculating importance weights in bootstrap sampling

نویسندگان

  • Hua Zhou
  • Kenneth Lange
چکیده

Importance sampling is an efficient strategy for reducing the variance of certain bootstrap estimates. It has found wide applications in bootstrap quantile estimation, proportional hazards regression, bootstrap confidence interval estimation, and other problems. Although estimation of the optimal sampling weights is a special case of convex programming, generic optimization methods are frustratingly slow on problems with large numbers of observations. For instance, interior point and adaptive barrier methods must cope with forming, storing, and inverting the Hessian of the objective function. In this paper, we present an efficient procedure for calculating the optimal importance weights and compare its performance to standard optimization methods on a representative data set. The procedure combines several potent ideas for large scale optimization.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computation of Weighted Functional Statistics Using Software That Does Not Support Weights Computation of Weighted Functional Statistics Using Software That Does Not Support Weights

We discuss methods for calculating statistics for weighted samples using software that does not support weights. Such samples arise in survey sampling with unequal probabilities, importance sampling, and bootstrap tilting. The software might not support weights for reasons of eeciency, simplicity, or because it was quicker to write the software without supporting weights. We discuss several tec...

متن کامل

Fast, Exact Bootstrap Principal Component Analysis for p > 1 million.

Many have suggested a bootstrap procedure for estimating the sampling variability of principal component analysis (PCA) results. However, when the number of measurements per subject (p) is much larger than the number of subjects (n), calculating and storing the leading principal components from each bootstrap sample can be computationally infeasible. To address this, we outline methods for fast...

متن کامل

The Biased-bootstrap for Gmm Models

In this talk, I present some theoretical and empirical properties of the uniform and biased-bootstrap for generalized method of moments (GMM) models. The version of the biased-bootstrap used in this paper is a form of weighted bootstrap with weights chosen to satisfy some constraints imposed by the model. A typical biased-bootstrap resample is obtained by resampling from a member within a pseud...

متن کامل

A New Robust Bootstrap Algorithm for the Assessment of Common Set of Weights in Performance Analysis

The performance of the units is defined as the ratio of the weighted sum of outputs to the weighted sum of inputs. These weights can be determined by data envelopment analysis (DEA) models. The inputs and outputs of the related (Decision Making Unit) DMU are assessed by a set of the weights obtained via DEA for each DMU. In addition, the weights are not generally common, but rather, they are ve...

متن کامل

SUGI 27: Use of the ROC Curve and the Bootstrap in Comparing Weighted Logistic Regression Models

In analyzing data from a survey, researchers often need to compare the effectiveness of several logistic regression models. The receiver operating characteristic curve offers one way to measure effectiveness of prediction, by calculating the area under the curve (AUC). We present a SAS macro for calculating AUC that takes the survey weights into account. For comparing logistic regression models...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computational statistics & data analysis

دوره 55 1  شماره 

صفحات  -

تاریخ انتشار 2011